Is it better to combine predictions?
نویسندگان
چکیده
We have compared the accuracy of the individual protein secondary structure prediction methods: PHD, DSC, NNSSP and Predator against the accuracy obtained by combing the predictions of the methods. A range of ways of combing predictions were tested: voting, biased voting, linear discrimination, neural networks and decision trees. The combined methods that involve 'learning' (the non-voting methods) were trained using a set of 496 non-homologous domains; this dataset was biased as some of the secondary structure prediction methods had used them for training. We used two independent test sets to compare predictions: the first consisted of 17 non-homologous domains from CASP3 (Third Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction); the second set consisted of 405 domains that were selected in the same way as the training set, and were non-homologous to each other and the training set. On both test datasets the most accurate individual method was NNSSP, then PHD, DSC and the least accurate was Predator; however, it was not possible to conclusively show a significant difference between the individual methods. Comparing the accuracy of the single methods with that obtained by combing predictions it was found that it was better to use a combination of predictions. On both test datasets it was possible to obtain a approximately 3% improvement in accuracy by combing predictions. In most cases the combined methods were statistically significantly better (at P = 0.05 on the CASP3 test set, and P = 0.01 on the EBI test set). On the CASP3 test dataset there was no significant difference in accuracy between any of the combined method of prediction: on the EBI test dataset, linear discrimination and neural networks significantly outperformed voting techniques. We conclude that it is better to combine predictions.
منابع مشابه
اثر عامل حالت در بهبود پیش بینی رخداد نوار برشی در خاک های دانه ای
The capability of sand constitutive models is remarkably improved by taking into account the effect of soil state in their formulations. In this study, it has been shown that considering the effect of soil state leads to better simulation of soil instability of shear banding type. To this aim, constitutive equations of a state dependent sand model are explained first. Consequently, the general ...
متن کاملپیشبینی جریان و انتقال حرارت در کانالهای ریبدار سه بعدی توسط مدلهای ?-K خطی و غیرخطی
The present paper deals with the prediction of three-dimensional fluid flow and heat transfer in rib-roughened ducts of square cross-section. Such flows are of direct relevance to the internal cooling system of modern gas turbine blades. The main objective is to assess how a recently developed variant of a cubic non-linear model (proposed by Craft et al. (1999)), that has been shown to produce ...
متن کاملComparing MicroRNA Target Gene Predictions Related to Alzheimer's Disease Using Online Bioinformatics Tools
Introduction: The prediction of microRNAs related to target genes using bioinformatics tools saves time and costs of the experimental analyses. In the present study, the prediction of microRNA target genes relevant to Alzheimer’s Diseases (AD) were compared with the experimentally reported data using different bioinformatics tools. Method: A total of 41 microRNAs associated with 21 essential ge...
متن کاملComparing MicroRNA Target Gene Predictions Related to Alzheimer's Disease Using Online Bioinformatics Tools
Introduction: The prediction of microRNAs related to target genes using bioinformatics tools saves time and costs of the experimental analyses. In the present study, the prediction of microRNA target genes relevant to Alzheimer’s Diseases (AD) were compared with the experimentally reported data using different bioinformatics tools. Method: A total of 41 microRNAs associated with 21 essential ge...
متن کاملProviding A Model for Management Earnings Forecast Bias
Despite The Important Role That Management Profit Forecasting Plays In The Decision Making Of Capital Market Actors, These Predictions Appear To Be Biased. In The Attempt To Measure The Bias Of Predicting Profit Management, Numerous One- Dimensional Measurement Tools Have Been Proposed In The Accounting And Finance Literature. Despite These Efforts, No Comprehensive Composite Index Has Been Dev...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Protein engineering
دوره 13 1 شماره
صفحات -
تاریخ انتشار 2000